[docs] [template] [data] Refactor current LLM Batch inference template #59897

Aydin-ab · 2026-01-06T18:19:34Z

follow up of this (closed for inactivity over the holidays):
#58571

There are a lot of files changed but the main technical content to review are the README.ipynb

For context, the goal is to refactor this current template
https://console.anyscale.com/template-preview/llm_batch_inference
And split it into 2: one on text data, and the other on vision data
Both are very similar, but should be read independently as two distinct content 👍

Signed-off-by: Aydin Abiar <[email protected]>

…full 2M Signed-off-by: Aydin Abiar <[email protected]>

Signed-off-by: Aydin Abiar <[email protected]>

…of batch size, concurrency, more refs to docs links, refactor quantization and model parallelism section for more readability, add image validation, mention anyscale runtime, pin datasets version Signed-off-by: Aydin Abiar <[email protected]>

Signed-off-by: Aydin Abiar <[email protected]>

Aydin-ab · 2026-01-08T01:40:15Z

@nrghosh
Followed your suggestions + the ones in your main comment (changing the prompt task etc)
I'll test the new code tomorrow but if the content looks ok now let me know 👍 Thanks a lot

doc/source/data/examples/llm_batch_inference_text/content/README.ipynb

doc/source/data/examples/llm_batch_inference_vision/content/README.ipynb

Signed-off-by: Aydin Abiar <[email protected]>

doc/source/data/examples/llm_batch_inference_vision/content/batch_inference_vision_scaled.py

nrghosh · 2026-01-08T18:10:16Z

/gemini review

nrghosh

thanks @Aydin-ab - see also cursor/gemini comments when it comes to code. As long as you are able to run them successfully, should be free of serious bugs now.

gemini-code-assist

Code Review

The pull request introduces new LLM batch inference examples for both text and vision data, along with their corresponding CI configurations and helper scripts. The changes effectively split the existing template into two distinct, independently readable content pieces, which is a good refactoring step. The new examples demonstrate how to use Ray Data LLM APIs for batch inference, including data preparation, processor configuration, and scaling considerations. The addition of CI scripts ensures these examples remain functional.

However, there are a few areas that could be improved for robustness and clarity:

The nb2py.py scripts use specific string matching to modify dataset limits for CI. This approach is brittle and could break if the exact string in the notebook changes.
Some comments in the Jupyter notebooks are slightly misleading regarding dataset size limits.
The standalone Python scripts contain hardcoded configuration values that would ideally be configurable for real-world use cases.

doc/source/data/examples/llm_batch_inference_text/ci/nb2py.py

doc/source/data/examples/llm_batch_inference_text/content/README.ipynb

doc/source/data/examples/llm_batch_inference_text/content/batch_inference_text.py

doc/source/data/examples/llm_batch_inference_vision/ci/nb2py.py

doc/source/data/examples/llm_batch_inference_vision/content/README.ipynb

doc/source/data/examples/llm_batch_inference_vision/content/batch_inference_vision.py

Signed-off-by: Aydin Abiar <[email protected]>

doc/source/data/examples/llm_batch_inference_text/content/batch_inference_text.py

doc/source/data/examples/llm_batch_inference_vision/content/batch_inference_vision.py

doc/source/data/examples/llm_batch_inference_vision/content/batch_inference_vision_scaled.py

Signed-off-by: Aydin Abiar <[email protected]>

doc/source/data/examples/llm_batch_inference_text/content/batch_inference_text.py

Signed-off-by: Aydin Abiar <[email protected]>

doc/source/data/examples/llm_batch_inference_vision/content/batch_inference_vision.py

Signed-off-by: Aydin Abiar <[email protected]>

nrghosh

thanks! minor comments but overall LGTM

datasets version in vision template (4.4.2) shows diff from job.yaml (4.4.1)
imports at top (not inline - style) for python files
image mode handling potential issue with batch_inference_vision.py and batch_inference_vision_scaled.py

when you open images with PIL, there can be issues with modes - something like

  image = Image.open(BytesIO(image))                                                                                                  
  if image.mode != 'RGB':                                                                                                             
      image = image.convert('RGB')

could improve robustness

partition counts are hardcoded (64/128/256) but explanation says "2-4x the worker (GPU) count" - so with concurrency=4 and 128 partitions, that's 32x the gpu count

overall great - extensive use of the apis and clear/helpful distinction between batch/online inference and well written scaling guidance + bits on model paralellism

doc/source/data/examples/llm_batch_inference_text/content/batch_inference_text_scaled.py

doc/source/data/examples/llm_batch_inference_text/content/batch_inference_text.py

doc/source/data/examples/llm_batch_inference_text/content/batch_inference_text_scaled.py

doc/source/data/examples/llm_batch_inference_text/content/batch_inference_text.py

doc/source/data/examples/llm_batch_inference_text/content/batch_inference_text_scaled.py

doc/source/data/examples/llm_batch_inference_vision/content/batch_inference_vision.py

nrghosh

approved with comments

Signed-off-by: Aydin Abiar <[email protected]>

Aydin Abiar added 30 commits November 12, 2025 12:10

move content into content/

9d3e8d1

Signed-off-by: Aydin Abiar <[email protected]>

make release test ci/ folder

cd90a58

Signed-off-by: Aydin Abiar <[email protected]>

make template workspace configs/ (compute config)

5c4e5ce

Signed-off-by: Aydin Abiar <[email protected]>

adding release tests

4a11152

Signed-off-by: Aydin Abiar <[email protected]>

remove README from sphinx discovery

ef2c09f

Signed-off-by: Aydin Abiar <[email protected]>

move order in examples.yml

21d6470

Signed-off-by: Aydin Abiar <[email protected]>

add vocabulary for vale compliance

f67cab0

Signed-off-by: Aydin Abiar <[email protected]>

(ci test only) limit size of large dataset to 10k samples instead of …

8f76af4

…full 2M Signed-off-by: Aydin Abiar <[email protected]>

fix link to notebook in examples.yml

7be6989

Signed-off-by: Aydin Abiar <[email protected]>

fix prompt

16a4d35

Signed-off-by: Aydin Abiar <[email protected]>

adding to examples.yml

ff185f0

Signed-off-by: Aydin Abiar <[email protected]>

adding release test config

d333e83

Signed-off-by: Aydin Abiar <[email protected]>

update content with new dataset

b6d758c

Signed-off-by: Aydin Abiar <[email protected]>

add byod for release test

1466052

Signed-off-by: Aydin Abiar <[email protected]>

nitpick template vision text

11486df

Signed-off-by: Aydin Abiar <[email protected]>

adding llm batch inference vision workspace

62b50b2

Signed-off-by: Aydin Abiar <[email protected]>

refactor content for consistency

1e4450b

Signed-off-by: Aydin Abiar <[email protected]>

fix worker node gce

31b0634

Signed-off-by: Aydin Abiar <[email protected]>

reformat

f5d2b61

Signed-off-by: Aydin Abiar <[email protected]>

vale

15e01ce

Signed-off-by: Aydin Abiar <[email protected]>

sphinx orphan for notebook

cfb1979

Signed-off-by: Aydin Abiar <[email protected]>

update image

858144c

Signed-off-by: Aydin Abiar <[email protected]>

adding batch inference llm visio nhelper modules

94006d7

Signed-off-by: Aydin Abiar <[email protected]>

reformat nitpicks

345483b

Signed-off-by: Aydin Abiar <[email protected]>

fix issues cursor

611a7b6

Signed-off-by: Aydin Abiar <[email protected]>

fix job config

92243df

Signed-off-by: Aydin Abiar <[email protected]>

fix job config for vision tempalte

06f24e6

Signed-off-by: Aydin Abiar <[email protected]>

add datasets to byod

271f2c9

Signed-off-by: Aydin Abiar <[email protected]>

ignore % command cell in ci testing

7575da0

Signed-off-by: Aydin Abiar <[email protected]>

change compute ocnfigs to L4 instead of L40S

4224abe

Signed-off-by: Aydin Abiar <[email protected]>

Aydin Abiar added 2 commits January 7, 2026 17:25

change prompt task to something more relevant to LLM capabilities

6ffeb78

Signed-off-by: Aydin Abiar <[email protected]>

cursor bot reviewed Jan 8, 2026

View reviewed changes

doc/source/data/examples/llm_batch_inference_text/content/README.ipynb Outdated Show resolved Hide resolved

doc/source/data/examples/llm_batch_inference_vision/content/README.ipynb Outdated Show resolved Hide resolved

fix typos, fix error in nb2py and update byod

4448896

Signed-off-by: Aydin Abiar <[email protected]>

cursor bot reviewed Jan 8, 2026

View reviewed changes

doc/source/data/examples/llm_batch_inference_vision/content/batch_inference_vision_scaled.py Outdated Show resolved Hide resolved

nrghosh reviewed Jan 8, 2026

View reviewed changes

gemini-code-assist bot reviewed Jan 8, 2026

View reviewed changes

Aydin Abiar added 3 commits January 8, 2026 11:26

add parameter to change dataset size

41de5fa

Signed-off-by: Aydin Abiar <[email protected]>

fix vale errors

cca6c0a

Signed-off-by: Aydin Abiar <[email protected]>

fix typo bug + add structured output to text task

9dbe446

Signed-off-by: Aydin Abiar <[email protected]>

cursor bot reviewed Jan 8, 2026

View reviewed changes

Aydin Abiar added 4 commits January 8, 2026 13:43

fix typo

6289489

Signed-off-by: Aydin Abiar <[email protected]>

fix structured output bug

d109d61

Signed-off-by: Aydin Abiar <[email protected]>

remove mention of notebook

22d189e

Signed-off-by: Aydin Abiar <[email protected]>

reorder examples with more consistent naming/titles

4982f68

Signed-off-by: Aydin Abiar <[email protected]>

cursor bot reviewed Jan 9, 2026

View reviewed changes

doc/source/data/examples/llm_batch_inference_text/content/batch_inference_text.py Outdated Show resolved Hide resolved

Aydin Abiar added 2 commits January 8, 2026 20:40

consistent .py script + add informative comments about structured output

55d6667

Signed-off-by: Aydin Abiar <[email protected]>

fix header typo

0f64701

Signed-off-by: Aydin Abiar <[email protected]>

cursor bot reviewed Jan 9, 2026

View reviewed changes

doc/source/data/examples/llm_batch_inference_vision/content/batch_inference_vision.py Outdated Show resolved Hide resolved

remove detokenize=

9f126ca

Signed-off-by: Aydin Abiar <[email protected]>

Aydin-ab added the go add ONLY when ready to merge, run all tests label Jan 12, 2026

Merge branch 'master' into docs/data/examples/llm_batch_inference_small

bd5f618

Signed-off-by: Aydin Abiar <[email protected]>

Aydin-ab requested a review from nrghosh January 12, 2026 22:59

nrghosh reviewed Jan 14, 2026

View reviewed changes

nrghosh approved these changes Jan 14, 2026

View reviewed changes

Aydin Abiar and others added 3 commits January 13, 2026 21:09

make files more consistent from their notebooks

c7ab455

Signed-off-by: Aydin Abiar <[email protected]>

fix imports in line + doc style nitpicks

4f0aa60

Signed-off-by: Aydin Abiar <[email protected]>

Merge branch 'master' into docs/data/examples/llm_batch_inference_small

76f142c

Signed-off-by: Aydin Abiar <[email protected]>

[docs] [template] [data] Refactor current LLM Batch inference template #59897

Are you sure you want to change the base?

[docs] [template] [data] Refactor current LLM Batch inference template #59897

Uh oh!

Conversation

Aydin-ab commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Aydin-ab commented Jan 8, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nrghosh commented Jan 8, 2026

Uh oh!

nrghosh left a comment

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nrghosh left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nrghosh left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Aydin-ab commented Jan 6, 2026 •

edited

Loading